Search CORE

40 research outputs found

Data-efficient learning of feedback policies from image pixels using deep dynamical models

Author: Assael J-AM
Deisenroth MP
Schön TB
Wahlström N
Publication venue
Publication date: 08/10/2015
Field of study

Data-efficient reinforcement learning (RL) in continuous state-action spaces using very high-dimensional observations remains a key challenge in developing fully autonomous systems. We consider a particularly important instance of this challenge, the pixels-to-torques problem, where an RL agent learns a closed-loop control policy ( torques ) from pixel information only. We introduce a data-efficient, model-based reinforcement learning algorithm that learns such a closed-loop policy directly from pixel information. The key ingredient is a deep dynamical model for learning a low-dimensional feature embedding of images jointly with a predictive model in this low-dimensional feature space. Joint learning is crucial for long-term predictions, which lie at the core of the adaptive nonlinear model predictive control strategy that we use for closed-loop control. Compared to state-of-the-art RL methods for continuous states and actions, our approach learns quickly, scales to high-dimensional state spaces, is lightweight and an important step toward fully autonomous end-to-end learning from pixels to torques

arXiv.org e-Print Archive

Spiral - Imperial College Digital Repository

Recommended from our members

Divide-and-Conquer With Sequential Monte Carlo

Author: Aston JAD
Bouchard-Côté A
Johansen AM
Kirkpatrick B
Lindsten F
Naesseth CA
Schön TB
Publication venue: Journal of Computational and Graphical Statistics
Publication date: 03/04/2017
Field of study

Apollo (Cambridge)

Recognizing recurrent neural networks (rRNN): Bayesian inference for recurrent neural networks

Author: A Doucet
A Lazar
A Parlos
AH Jazwinski
B Cessac
C Archambeau
C Summerfield
D Debanne
D Mottet
D Perdikis
D Perdikis
D Verstraeten
DV Buonomano
EA Wan
EA Wan
EK Miller
EK Pissadaki
FH Hamker
G Schöner
GE Hinton
H Jaeger
J Daunizeau
J Ting-Ho Lo
JAS Kelso
JL Elman
JT Connor
K Friston
K Friston
K Narendra
K Sidiropoulou
KJ Friston
KJ Friston
KJ Friston
M Bar
M Boerlin
MI Rabinovich
N Spruston
R Blake
R Legenstein
R Wilson
RC Sotero
RP Rao
RP Rao
RPN Rao
S Denève
S Denève
S Rodrigues
S Roweis
Sebastian Bitzer
SJ Kiebel
SJ Kiebel
SJ Kiebel
SJ Kiebel
Stefan J. Kiebel
TB Schön
V Wassenhove van
VK Jirsa
W Maass
Z Ghahramani
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Recurrent neural networks (RNNs) are widely used in computational neuroscience and machine learning applications. In an RNN, each neuron computes its output as a nonlinear function of its integrated input. While the importance of RNNs, especially as models of brain processing, is undisputed, it is also widely acknowledged that the computations in standard RNN models may be an over-simplification of what real neuronal networks compute. Here, we suggest that the RNN approach may be made both neurobiologically more plausible and computationally more powerful by its fusion with Bayesian inference techniques for nonlinear dynamical systems. In this scheme, we use an RNN as a generative model of dynamic input caused by the environment, e.g. of speech or kinematics. Given this generative RNN model, we derive Bayesian update equations that can decode its output. Critically, these updates define a 'recognizing RNN' (rRNN), in which neurons compute and exchange prediction and prediction error messages. The rRNN has several desirable features that a conventional RNN does not have, for example, fast decoding of dynamic stimuli and robustness to initial conditions and noise. Furthermore, it implements a predictive coding scheme for dynamic inputs. We suggest that the Bayesian inversion of recurrent neural networks may be useful both as a model of brain function and as a machine learning tool. We illustrate the use of the rRNN by an application to the online decoding (i.e. recognition) of human kinematics

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector

MPG.PuRe

Extended Kalman Filter for Estimation of Parameters in Nonlinear State-Space Models of Biochemical Networks

Author: A Doucet
A Germani
A Sitz
AW Natal
BN Kholodenko
CG Moles
D Henderson
D Simon
DC Clarke
E Klipp
G Riddihough
Gustavo Stolovitzky
GV HarshaRani
HR Künsch
I Arisi
I Nachman
I Swameye
IC Chou
J Timmer
JL Snoep
K Ogata
KH Cho
L Lang
L Ljung
Li Jin
M Quach
M Rodriguez-Fernandez
M Sugimoto
Momiao Xiong
N Jamshidi
P Li
PJ Costa
R Steuer
S Bader
S Borger
S Grimbs
SA Sisson
SJ Julier
T Kisseleva
T Kitayama
T Maiwald
T Tian
TA Nelson
TB Schön
ViolinD
W Kolch
Xiaodian Sun
XL Hu
Y Yu
Z Ghahramani
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

It is system dynamics that determines the function of cells, tissues and organisms. To develop mathematical models and estimate their parameters are an essential issue for studying dynamic behaviors of biological systems which include metabolic networks, genetic regulatory networks and signal transduction pathways, under perturbation of external stimuli. In general, biological dynamic systems are partially observed. Therefore, a natural way to model dynamic biological systems is to employ nonlinear state-space equations. Although statistical methods for parameter estimation of linear models in biological dynamic systems have been developed intensively in the recent years, the estimation of both states and parameters of nonlinear dynamic systems remains a challenging task. In this report, we apply extended Kalman Filter (EKF) to the estimation of both states and parameters of nonlinear state-space models. To evaluate the performance of the EKF for parameter estimation, we apply the EKF to a simulation dataset and two real datasets: JAK-STAT signal transduction pathway and Ras/Raf/MEK/ERK signaling transduction pathways datasets. The preliminary results show that EKF can accurately estimate the parameters and predict states in nonlinear state-space equations for modeling dynamic biochemical networks

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

DigitalCommons@The Texas Medical Center

Using inertial sensors for position and orientation estimation

Author: Hol JD
Kok M
Schön TB
Publication venue
Publication date: 01/01/2017
Field of study

In recent years, microelectromechanical system (MEMS) inertial sensors (3D accelerometers and 3D gyroscopes) have become widely available due to their small size and low cost. Inertial sensor measurements are obtained at high sampling rates and can be integrated to obtain position and orientation information. These estimates are accurate on a short time scale, but suffer from integration drift over longer time scales. To overcome this issue, inertial sensors are typically combined with additional sensors and models. In this tutorial we focus on the signal processing aspects of position and orientation estimation using inertial sensors. We discuss different modeling choices and a selected number of important algorithms. The algorithms include optimization-based smoothing and filtering as well as computationally cheaper extended Kalman filter and complementary filter implementations. The quality of their estimates is illustrated using both experimental and simulated data

CUED - Cambridge University Engineering Department

Identification of Gaussian process state-space models with particle stochastic approximation EM

Author: Frigola R
Lindsten F
Rasmussen CE
Schön TB
Publication venue
Publication date: 17/12/2013
Field of study

Gaussian process state-space models (GP-SSMs) are a very flexible family of models of nonlinear dynamical systems. They comprise a Bayesian nonparametric representation of the dynamics of the system and additional (hyper-)parameters governing the properties of this nonparametric representation. The Bayesian formalism enables systematic reasoning about the uncertainty in the system dynamics. We present an approach to maximum likelihood identification of the parameters in GP-SSMs, while retaining the full nonparametric description of the dynamics. The method is based on a stochastic approximation version of the EM algorithm that employs recent developments in particle Markov chain Monte Carlo for efficient identification

arXiv.org e-Print Archive

CiteSeerX

Publikationer från Linköpings universitet

Crossref

Digitala Vetenskapliga Arkivet - Academic Archive On-line

CUED - Cambridge University Engineering Department

Bayesian inference and learning in Gaussian process state-space models with Particle MCMC

Author: Frigola R
Lindsten F
Rasmussen CE
Schön TB
Publication venue
Publication date: 01/01/2013
Field of study

State-space models are successfully used in many areas of science, engineering and economics to model time series and dynamical systems. We present a fully Bayesian approach to inference and learning (i.e. state estimation and system identification) in nonlinear nonparametric state-space models. We place a Gaussian process prior over the state transition dynamics, resulting in a flexible model able to capture complex dynamical phenomena. To enable efficient inference, we marginalize over the transition dynamics function and, instead, infer directly the joint smoothing distribution using specially tailored Particle Markov Chain Monte Carlo samplers. Once a sample from the smoothing distribution is computed, the state transition predictive distribution can be formulated analytically. Our approach preserves the full nonparametric expressivity of the model and can make use of sparse Gaussian processes to greatly reduce computational complexity

CiteSeerX

CUED - Cambridge University Engineering Department

Rao-Blackwellized Particle Smoothers for Conditionally Linear Gaussian Models

Author: Bunch P
Godsill SJ
Lindsten F
Schön TB
Särkkä S
Publication venue
Publication date: 01/03/2016
Field of study

Sequential Monte Carlo (SMC) methods, such as the particle filter, are by now one of the standard computational techniques for addressing the filtering problem in general state-space models. However, many applications require post-processing of data offline. In such scenarios the smoothing problem - in which all the available data is used to compute state estimates - is of central interest. We consider the smoothing problem for a class of conditionally linear Gaussian models. We present a forward-backward-type Rao-Blackwellized particle smoother (RBPS) that is able to exploit the tractable substructure present in these models. Akin to the well known Rao-Blackwellized particle filter, the proposed RBPS marginalizes out a conditionally tractable subset of state variables, effectively making use of SMC only for the 'intractable part' of the model. Compared to existing RBPS, two key features of the proposed method are: 1) it does not require structural approximations of the model, and 2) the aforementioned marginalization is done both in the forward direction and in the backward direction

CUED - Cambridge University Engineering Department

Perioperative Antibiotikaprophylaxe bei der radikalen retropubischen Prostatektomie

Author: AB Kaiser
G. Schön
JW Lumley
N. Tawassoli
OS Nielsen
TB Hargreave
WK Mebust
WT Morris
WT Morris
WT Morris
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Nonlinear System Identification Using Particle Filters

Author: A Dempster
A Doucet
C Andrieu
CE Rasmussen
CP Robert
F Lindsten
F Lindsten
G Poyiadjis
NJ Gordon
O Cappé
O Cappé
TB Schön
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref